Code-switched English Pronunciation Modeling for Swahili Spoken Term Detection

نویسندگان

  • Neil Kleynhans
  • William Hartmann
  • Daniel R. van Niekerk
  • Charl Johannes van Heerden
  • Richard M. Schwartz
  • Stavros Tsakalidis
  • Marelie H. Davel
چکیده

We investigate modeling strategies for English code-switched words as found in a Swahili spoken term detection system. Code switching, where speakers switch language in a conversation, occurs frequently in multilingual environments, and typically deteriorates STD performance. Analysis is performed in the context of the IARPA Babel program which focuses on rapid STD system development for under-resourced languages. Our results show that approaches that specifically target the modeling of code-switched words, significantly improve the detection performance of these words.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Developments of Swahili resources for an automatic speech recognition system

This article describes our efforts to provide ASR resources for Swahili, a Bantu language spoken in a wide area of East Africa. We start with an introduction on the language situation, both at linguistic and digital level. Then, we report the selected strategies to develop a text corpus, a pronunciation dictionary and a speech corpus for this under-resourced language. We explore methodologies a...

متن کامل

Swahili Text-to-speech System

Text-to-speech (TTS) applications have been applied in diverse areas all over the world. Considering the fact that Swahili pronunciation is not complicated, and the language spoken by about 45 – 100 million people as their first or second language,, we considered the feasibility, and developed a Swahili Text-to-Speech (TTS) system. This paper gives an account of the Swahili TTS system developed...

متن کامل

Spoken Word Recognition of Code-Switched Words by Chinese-English Bilinguals

Two experiments with Chinese–English bilinguals were conducted to examine the recognition of code-switched words in speech. In Experiment 1, listeners were asked to identify a codeswitched word in a sentence on the basis of increasing fragments of the word. In Experiment 2, listeners repeated the code-switched word following a predesignated point upon hearing the sentence. Converging evidence f...

متن کامل

English Pronunciation Instruction: A Literature Review

English pronunciation instruction is difficult for some reasons. Teachers are left without clear guidelines and are faced with contradictory practices for pronunciation instruction. There is no well-established systematic method of deciding what to teach, when, and how to do it. As a result of these problems, pronunciation instruction is less important and teachers are not very comfortable in t...

متن کامل

Recognition and Verification of Engl for Computer-assisted Languag

We address methods for recognizing English spoken by Japanese students as the basis for our Computer-Assisted Language Learning (CALL) system. For automatic phonemic error detection, pronunciation error prediction is executed for a given orthographic text. To improve reliability, speaker adaptation and segment-input pair-wise verification are applied as pre-processing and post-processing, respe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016